Minimizing Image or Video Annotation Effort Using Effective Relevance Models

نویسندگان

  • Anton Leuski
  • George E. P. Box
چکیده

Humans primarily resort to text or words to describe or present their thoughts when concerning themselves with various forms of multimedia content, such as visual (images or videos) or acoustic (music, etc.). Thus the task of associating text descriptions with images or videos is germane to any human-interpretable discourse around visual multimedia data. Therefore, the multimedia community has focused its attention towards two, in some sense inverse problems, related to the task of associating text with non-text multimedia content (such as images or videos). They are annotation and retrieval. In the former, the goal is to associate text descriptions with the content while in the latter the goal is to retrieve a set of relevant videos or images corresponding to a given text query. This work proposes a novel statistical approach which attempts to address these two questions, also considering the trade-off between the effectiveness of the algorithm and the associated cost of requiring a sizable annotated training data. This effort is directed towards closing the “semantic gap” between representations extracted from images or videos and their corresponding textual descriptions, in an efficient fashion. The traditional image or video annotation or retrieval models rely on the principle of query-likelihood. In such a setting, every document (i.e. the image or video) is associated with a text description. The query-likelihood model assumes the text of the document to have been generated by some underlying probability distribution and the query to have been sampled from that. Such models however, require a sizable amount of training data for a good performance. In this work, we attempt to address this lacunae. We resort to active learning for this. In an active learning setting, using a measure of sample informativeness, we determine an order for labeling the samples in the training data. This order allows us to achieve a good performance even without requiring the entire training set to be labeled. Finally we also propose a novel approach for retrieval of multimedia content, which may be easily extended for annotation as well. Our approach deviates from traditional approaches, since we do not adopt the query-likelihood model for addressing this question. Instead our model, assumes the query and the document to be coming from two different distributions, which are then compared to obtain a measure of relevance. This allows our model more flexibility, in modeling the user’s information needs. The proposed model, is then adapted in the active learning setting described above to achieve efficient and effective retrieval.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Survey on Interactive Video Retrieval Using Active Learning Approach

Active learning is a machine learning technique which chooses the most informative models for labelling and uses them as training data. It has been extensively explored in multimedia research area for reducing human annotation effort. In this article, efforts of active learning in multimedia annotation and retrieval have been surveyed .The application domains such as image or video annotation, ...

متن کامل

Fuzzy Neighbor Voting for Automatic Image Annotation

With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...

متن کامل

Trans Media Relevance Feedback for Image Autoannotation

Automatic image annotation is an important tool for keyword-based image retrieval, providing a textual index for non-annotated images. Many image auto annotation methods are based on visual similarity between images to be annotated and images in a training corpus. The annotations of the most similar training images are transferred to the image to be annotated. In this paper we consider using al...

متن کامل

Tags Re-ranking Using Multi-level Features in Automatic Image Annotation

Automatic image annotation is a process in which computer systems automatically assign the textual tags related with visual content to a query image. In most cases, inappropriate tags generated by the users as well as the images without any tags among the challenges available in this field have a negative effect on the query's result. In this paper, a new method is presented for automatic image...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015